Action Recognition using Temporal Bag-of-Words from Depth Maps
نویسندگان
چکیده
In this paper, we present a methodology for human action recognition from a sequence of depth maps obtained using Microsoft Kinect. Specifically, we use a Temporal Bag-of-Words model as representation scheme to capture the variation of features across the temporal domain. Our methodology builds the Temporal Bag-of-Words model on top of the spatiotemporal features extracted from interest points. The local spatio-temporal features provide some invariance to scale, viewpoint changes by capturing the local information. In order to make the representation insensitive to temporal sequence misalignment, we propose using the Temporal Bag-of-Words model in a hierarchical manner by recursively partitioning the depth maps sequence into sub-sequences in temporal domain. Classification is done using SVM. We test our algorithm on our own dataset consisting of eight different actions.
منابع مشابه
Joint Angles Similiarities and HOG for Action Recognition
We propose a set of features derived from skeleton tracking of the human body and depth maps for the purpose of action recognition. The descriptors proposed are easy to implement, produce relatively small-sized feature sets, and the multi-class classification scheme is fast and suitable for real-time applications. We intuitively characterize actions using pairwise affinities between view-invari...
متن کاملJoint Angles Similarities and HOG for Action Recognition
We propose a set of features derived from skeleton tracking of the human body and depth maps for the purpose of action recognition. The descriptors proposed are easy to implement, produce relatively small-sized feature sets, and the multi-class classification scheme is fast and suitable for real-time applications. We intuitively characterize actions using pairwise affinities between view-invari...
متن کاملRobust 3D Action Recognition through Sampling Local Appearances and Global Distributions
3D action recognition has broad applications in human-computer interaction and intelligent surveillance. However, recognizing similar actions remains challenging since previous literature fails to capture motion and shape cues effectively from noisy depth data. In this paper, we propose a novel two-layer Bag-of-Visual-Words (BoVW) model, which suppresses the noise disturbances and jointly encod...
متن کاملStatistics of Pairwise Co-occurring Local Spatio-temporal Features for Human Action Recognition
The bag-of-words approach with local spatio-temporal features have become a popular video representation for action recognition in videos. Together these techniques have demonstrated high recognition results for a number of action classes. Recent approaches have typically focused on capturing global statistics of features. However, existing methods ignore relations between features and thus may...
متن کاملHuman Action Recognition Based on Boosting
Human action recognition is an active research field in computer vision and image processing. In this paper we propose a novel method for the task of recognition of human actions in video image sequences. First of all, a video sequence is represented as a collection of spatial-temporal words by extracting space-time interest points, which is used to characterize action. Then visual words are us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013